Semi-automatic, data-driven construction of multimedia ontologies
نویسندگان
چکیده
In this paper we investigate semi-automatic construction of multimedia ontologies using a data-driven approach. We start with a collection of videos for which we wish to build an ontology (an explicit specification of a domain). Each video is pre-processed: scene cut detection, automatic speech recognition (ASR), and metadata extraction are performed. In addition we automatically index the videos based on visual content by extracting syntactic (e.g., color, texture, etc.) and semantic features (e.g., face, landscape, etc.). We then combine standard tools for ontology engineering and tools in content-based retrieval to semi-automatically build ontologies. In the first stage we process the text information available with the videos (ASR, metadata, and annotations, if any). Stop words (e.g., a, on, the) are eliminated and statistics (e.g., frequency, TFIDF, and entropy) are computed for all terms. Based on this data we manually select concepts and relationships to include in the ontology. Then we use content-based retrieval tools to assign multimedia entities (e.g., shots, videos, collections of videos) to concepts, properties, or relationships in the ontology, and to select multimedia entities as concepts, relationships, or properties in the ontology. We explore this methodology to construct multimedia ontologies from 24 hours of educational films from the 1940s-1960s used in the TREC video retrieval benchmark and discuss the problems encountered and future directions.
منابع مشابه
Multimedia Annotation System for Intelligent Search*
In this paper we present an overview of the intelligent multimedia annotation and search system MetaOn. The core objective is to construct and integrate semantically rich metadata, extracted from documents and images, to facilitate intelligent search and analysis. The proposed MetaOn framework involves, ontology-based information extraction and data mining, semi-automatic construction of domain...
متن کاملMOWIS: A System for Building Multimedia Ontologies from Web Information Sources
Defining ontologies within the multimedia domain still remains a challenging task, due to the complexity of multimedia data and the related associated knowledge. In this paper, we propose: i) a novel multimedia ontology model that combine both low level descriptors and high level semantic concepts; ii) an automatic construction of ontologies using the Flickrweb services, that provide images, ta...
متن کاملInduction on the Semantic Web
The Semantic Web is increasingly populated with instance data, nowadays often in the form of Linked Data. Consequently, machine learning and other instance driven approaches are of increasing relevance. In this special issue we have collected various inductive approaches and approaches from relational learning for solving a number of tasks. In particular, inductive methods are applied to learn ...
متن کاملMultimedia Ontology Life Cycle Management with the SALERO Semantic Workbench
Ontologies are gaining increased importance in the area of multimedia retrieval or management as they try to overcome the commonly known drawbacks of existing multimedia metadata standards for the descriptions of the semantics of multimedia content. In order to build and use ontologies, user have to receive appropriate support. This paper presents the SALERO Semantic Workbench which offers a se...
متن کاملModal Keywords, Ontologies, and Reasoning for Video Understanding
We proposed a novel framework for video content understanding that uses rules constructed from knowledge bases and multimedia ontologies. Our framework consists of an expert system that uses a rule-based engine, domain knowledge, visual detectors (for objects and scenes), and metadata (text from automatic speech recognition, related text, etc.). We introduce the idea of modal keywords, which ar...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003